Function Approximation for Continuous Constrained MDPs

نویسندگان

  • Aditya Undurti
  • Alborz Geramifard
  • Jonathan P. How
چکیده

In this work we apply function approximation techniques to solve continuous, constrained Markov Decision Processes (MDPs). Many real-world robot planning problems are best represented as MDPs with a continuous state space. However, in many scenarios constraints must be accounted for as well. These constraints are treated probabilistically, with a bound on the constraint violation probability. Existing function approximation literature does not account for constraints, and existing constrained MDP work assumes a discrete state space. We seek to bridge this gap by maintaining two continuous value functions one for the reward and one for the constraint violation probability and approximate these value functions by writing them as a linear combination of a set of basis functions (not necessarily the same set for both value functions). We present some simulation results and investigate the impact of choice of basis functions on the quality of policies generated.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Solution Methods for Constrained Markov Decision Process with Continuous Probability Modulation

We propose solution methods for previouslyunsolved constrained MDPs in which actions can continuously modify the transition probabilities within some acceptable sets. While many methods have been proposed to solve regular MDPs with large state sets, there are few practical approaches for solving constrained MDPs with large action sets. In particular, we show that the continuous action sets can ...

متن کامل

Symmetric Primal-Dual Approximate Linear Programming for Factored MDPs

A weakness of classical Markov decision processes is that they scale very poorly due to the flat state-space representation. Factored MDPs address this representational problem by exploiting problem structure to specify the transition and reward functions of an MDP in a compact manner. However, in general, solutions to factored MDPs do not retain the structure and compactness of the problem rep...

متن کامل

Bounded Approximate Symbolic Dynamic Programming for Hybrid MDPs

Recent advances in symbolic dynamic programming (SDP) combined with the extended algebraic decision diagram (XADD) data structure have provided exact solutions for mixed discrete and continuous (hybrid) MDPs with piecewise linear dynamics and continuous actions. Since XADD-based exact solutions may grow intractably large for many problems, we propose a bounded error compression technique for XA...

متن کامل

Continuous Discrete Variable Optimization of Structures Using Approximation Methods

Optimum design of structures is achieved while the design variables are continuous and discrete. To reduce the computational work involved in the optimization process, all the functions that are expensive to evaluate, are approximated. To approximate these functions, a semi quadratic function is employed. Only the diagonal terms of the Hessian matrix are used and these elements are estimated fr...

متن کامل

Piecewise Linear Value Function Approximation for Factored MDP

A number of proposals have been put forth in recent years for the solution of Markov decision processes (MDPs) whose state (and sometimes action) spaces are factored. One recent class of methods involves linear value function approximation, where the optimal value function is assumed to be a linear combination of some set of basis functions, with the aim of finding suitable weights. While sophi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011